Metal backend: enable linear with bias by manuelcandales · Pull Request #17115 · pytorch/executorch

manuelcandales · 2026-02-02T21:27:17Z

This pull request introduces a new Metal-specific graph transformation pass for the Apple Metal backend, aimed at decomposing aten.linear operations into simpler operations to improve compatibility and avoid reliance on addmm. It also adds the new pass to the backend's custom passes, exposes it in the package, and introduces a corresponding test module.

Metal Backend Linear Decomposition Pass:

Added a new DecomposeLinearPass in decompose_linear_pass.py, which rewrites aten.linear nodes in the computation graph into a sequence of matmul and add operations. For 2D inputs, it temporarily unsqueezes to 3D to avoid triggering addmm, then squeezes back to 2D, ensuring compatibility with Metal's capabilities.
Registered the new DecomposeLinearPass in the Metal backend's get_custom_passes method, so it is applied during compilation.
Exposed DecomposeLinearPass in the passes module's __init__.py for easier imports and future extensibility.

Testing and Module Registry:

Added a new LinearWithBias module to the module registry in the test suite, providing a model that exercises the new pass and ensuring coverage for linear layers with bias.

[ghstack-poisoned]

manuelcandales · 2026-02-02T21:27:18Z

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2026-02-02T21:27:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17115

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 7 Cancelled Jobs, 3 Unrelated Failures

As of commit 65e572d with merge base fb59a47 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner / linux-job (gh)
>>> Lint for backends/apple/metal/tests/test_modules.py:
pull / unittest / linux / linux-job (gh)
RuntimeError: Command docker exec -t b707283d13a1d11949190137eff5da7d01310bdb5297c1d4535c7b67accc1656 /exec failed with exit code 1
pull / unittest / macos / macos-job (gh)
backends/xnnpack/test/ops/test_conv2d.py::TestConv2d::test_fp16_conv2d
pull / unittest-editable / linux / linux-job (gh)
RuntimeError: Command docker exec -t 72760b1ad0cca31c20b5ffe5a7bf20bb71b1c56231d9c012bf16ef4a5c0ee246 /exec failed with exit code 1

CANCELLED JOBS - The following jobs were cancelled. Please retry:

Test Metal Backend / export-model-metal-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / macos-job (gh)
##[error]The operation was canceled.
Test Metal Backend / export-model-metal-artifact (nvidia, parakeet-tdt, non-quantized) / macos-job (gh)
##[error]The operation was canceled.
Test Metal Backend / export-model-metal-artifact (nvidia, parakeet-tdt, quantized-int4-metal) / macos-job (gh)
##[error]The operation was canceled.
Test Metal Backend / export-model-metal-artifact (openai, whisper-large-v3-turbo, non-quantized) / macos-job (gh)
##[error]The operation was canceled.
Test Metal Backend / export-model-metal-artifact (openai, whisper-small, non-quantized) / macos-job (gh)
##[error]The operation was canceled.
Test Metal Backend / test-executorch-metal-build / macos-job (gh)
##[error]The operation was canceled.
Test Metal Backend / test-metal-backend-modules / macos-job (gh)
##[error]The operation was canceled.

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-samsung-models-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-samsung-quantmodels-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

github-actions · 2026-02-05T18:36:16Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

[ghstack-poisoned]

Copilot

Pull request overview

A pull request that introduces a Metal backend pass to decompose aten.linear operations into simpler operations (matmul + add) to avoid using addmm, which is not implemented in the Metal backend. The PR includes the new pass implementation, integration into the Metal backend, and a test module.

Changes:

Added DecomposeLinearPass that decomposes linear operations into matmul and add operations
Integrated the pass into Metal backend's custom passes
Added a test module LinearWithBias to exercise the new pass

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
backends/apple/metal/passes/decompose_linear_pass.py	New pass implementation for decomposing linear operations
backends/apple/metal/passes/init.py	Exposes the new pass in the passes module
backends/apple/metal/metal_backend.py	Registers the new pass in the Metal backend
backends/apple/metal/tests/test_modules.py	Adds test module for linear layers with bias

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-05T18:42:19Z

backends/apple/metal/tests/test_modules.py

+
+

Missing closing brace in the MODULE_REGISTRY dictionary definition. The dictionary is opened but not closed, which will cause a syntax error.

Suggested change

}

Copilot · 2026-02-05T18:42:20Z

backends/apple/metal/tests/test_modules.py

+MODULE_REGISTRY["linear_bias"] = {
+    "model_class": LinearWithBias,
+    "input_shapes": [(127, 7)],
+    "description": "Simple linear layer model bias",


The description string is incomplete and grammatically incorrect. It should be "Simple linear layer model with bias" to be consistent with the "linear_nobias" description above.

Suggested change

"description": "Simple linear layer model bias",

"description": "Simple linear layer model with bias",

Copilot · 2026-02-05T18:42:20Z

backends/apple/metal/passes/decompose_linear_pass.py

+                    if hasattr(input_node, "meta") and "val" in input_node.meta:
+                        if len(input_node.meta["val"].shape) == 2:
+                            needs_unsqueeze = True


The pass assumes that all input nodes have metadata with a 'val' attribute to determine dimensionality. If metadata is missing or incomplete, the pass will not unsqueeze 2D inputs, which could result in the addmm code path being taken instead of matmul. This defeats the purpose of the pass. Consider adding a fallback or error handling when metadata is not available, or document this limitation clearly.

Suggested change

if hasattr(input_node, "meta") and "val" in input_node.meta:

if len(input_node.meta["val"].shape) == 2:

needs_unsqueeze = True

if hasattr(input_node, "meta"):

val_meta = input_node.meta.get("val", None)

if val_meta is not None and hasattr(val_meta, "shape"):

if len(val_meta.shape) == 2:

needs_unsqueeze = True

else:

raise RuntimeError(

"DecomposeLinearPass requires input_node.meta['val'] with a 'shape' "

"attribute to determine input dimensionality."

)

Copilot · 2026-02-05T18:42:20Z

backends/apple/metal/passes/decompose_linear_pass.py

+                        unsqueeze_op = exir_ops.edge.aten.unsqueeze.default
+                        squeeze_op = exir_ops.edge.aten.squeeze.dims


The pass uses incorrect operator names for edge dialect squeeze and unsqueeze operations. It should use exir_ops.edge.aten.squeeze_copy.dims instead of exir_ops.edge.aten.squeeze.dims, and exir_ops.edge.aten.unsqueeze_copy.default instead of exir_ops.edge.aten.unsqueeze.default. The edge dialect typically uses _copy variants of view operations.

Suggested change

unsqueeze_op = exir_ops.edge.aten.unsqueeze.default

squeeze_op = exir_ops.edge.aten.squeeze.dims

unsqueeze_op = exir_ops.edge.aten.unsqueeze_copy.default

squeeze_op = exir_ops.edge.aten.squeeze_copy.dims

[ghstack-poisoned]

manuelcandales added 6 commits January 30, 2026 19:25

Update

39db621

[ghstack-poisoned]

Update

0ed7c5c

[ghstack-poisoned]

Update

b4310cc

[ghstack-poisoned]

Update

94c823c

[ghstack-poisoned]

Update

31b6f45

[ghstack-poisoned]

Update

c68cc6b

[ghstack-poisoned]

manuelcandales requested review from cccclai and shoumikhin as code owners February 2, 2026 21:27

Update

bd7192f

[ghstack-poisoned]

Update

bcc8bda

[ghstack-poisoned]

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 2, 2026

Update

f166c50

[ghstack-poisoned]

This was referenced Feb 2, 2026

Metal backend: test modules #17076

Merged

Metal backend: remove parakeet linear decomp #17116

Open

Metal backend: support 4-bit linear #17117

Merged

Metal backend: enable quantization in test_modules #17118

Merged

manuelcandales requested review from larryliu0820 and mergennachin and removed request for cccclai and shoumikhin February 2, 2026 21:29

manuelcandales added 8 commits February 2, 2026 17:26

Update

0834659

[ghstack-poisoned]

Update

ed4dcee

[ghstack-poisoned]

Update

a058197

[ghstack-poisoned]

Update

7146282

[ghstack-poisoned]

Update

d3501af

[ghstack-poisoned]

Update

fe5be37

[ghstack-poisoned]

Update

a0e3469

[ghstack-poisoned]

Update

fcfa832

[ghstack-poisoned]

manuelcandales added 2 commits February 4, 2026 23:29

Update

c96a67f

[ghstack-poisoned]

Update

e81b589

[ghstack-poisoned]

mergennachin approved these changes Feb 5, 2026

View reviewed changes

Base automatically changed from gh/manuelcandales/151/head to main February 5, 2026 16:48

manuelcandales added 2 commits February 5, 2026 13:35

Update

0f2cddd

[ghstack-poisoned]

Update

96e72b1

[ghstack-poisoned]

Copilot AI review requested due to automatic review settings February 5, 2026 18:35

manuelcandales requested review from GregoryComer and lucylq as code owners February 5, 2026 18:35

manuelcandales changed the base branch from main to gh/manuelcandales/163/head February 5, 2026 18:35

Copilot started reviewing on behalf of manuelcandales February 5, 2026 18:36 View session

manuelcandales added 2 commits February 5, 2026 13:42

Update

8ff273f

[ghstack-poisoned]

Update

5b3ea00

[ghstack-poisoned]

Copilot AI reviewed Feb 5, 2026

View reviewed changes

manuelcandales added 5 commits February 5, 2026 14:09

Update

4316164

[ghstack-poisoned]

Update

6bab05d

[ghstack-poisoned]

Update

401af46

[ghstack-poisoned]

Update

957ba1f

[ghstack-poisoned]

Update

5b457db

[ghstack-poisoned]

manuelcandales mentioned this pull request Feb 5, 2026

Metal backend: fix linear_filter in test_modules #17253

Merged

manuelcandales added 8 commits February 5, 2026 15:05

Update

87f1529

[ghstack-poisoned]

Update

9ea88a9

[ghstack-poisoned]

Update

4fb7659

[ghstack-poisoned]

Update

cf89a2b

[ghstack-poisoned]

Update

56f91d6

[ghstack-poisoned]

Update

46e48be

[ghstack-poisoned]

Update

4962722

[ghstack-poisoned]

Update

65e572d

[ghstack-poisoned]

Base automatically changed from gh/manuelcandales/163/head to main February 5, 2026 22:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metal backend: enable linear with bias#17115

Metal backend: enable linear with bias#17115
manuelcandales wants to merge 64 commits intomainfrom
gh/manuelcandales/152/head

manuelcandales commented Feb 2, 2026 •

edited

Loading

Uh oh!

manuelcandales commented Feb 2, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 2, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	"description": "Simple linear layer model bias",
	"description": "Simple linear layer model with bias",

-                    if hasattr(input_node, "meta") and "val" in input_node.meta:
-                        if len(input_node.meta["val"].shape) == 2:
-                            needs_unsqueeze = True
+                    if hasattr(input_node, "meta"):
+                        val_meta = input_node.meta.get("val", None)
+                        if val_meta is not None and hasattr(val_meta, "shape"):
+                            if len(val_meta.shape) == 2:
+                                needs_unsqueeze = True
+                        else:
+                            raise RuntimeError(
+                                "DecomposeLinearPass requires input_node.meta['val'] with a 'shape' "
+                                "attribute to determine input dimensionality."
+                            )

		unsqueeze_op = exir_ops.edge.aten.unsqueeze.default
		squeeze_op = exir_ops.edge.aten.squeeze.dims

Conversation

manuelcandales commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

manuelcandales commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17115

❌ 4 New Failures, 7 Cancelled Jobs, 3 Unrelated Failures

Uh oh!

github-actions bot commented Feb 5, 2026

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

manuelcandales commented Feb 2, 2026 •

edited

Loading

manuelcandales commented Feb 2, 2026 •

edited

Loading

pytorch-bot bot commented Feb 2, 2026 •

edited

Loading

This PR needs a `release notes:` label